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STATUS OF CLAIMS 
Claims 1 through 12 are pending in the above-identified patent 
application. Claims 1-4, 6-8, 10, and 11 remain rejected under 35 U.S.C. § 102(b) as 
being anticipated by Kupiec (United States Patent Number .5,696,962) and claims 5, 9, 
5 and 12 remain rejected under 35 U.S.C. § 103(a) as being unpatentable over Kupiec in 
view of Braden-Harder et al. (United States Patent Number 5,933,822). 

STATUS OF AMENDMENTS 
There have been no amendments filed subsequent to the non-final 

10 rejection. 

SUMMARY OF CLAIMED SUBJECT MATTER 
The present invention is directed to automated question answering that 
relates to the selection of an answer to a question from a pool of potential answers which 

15 are manually or automatically extracted from a large collection of textual documents 
(page 5, line 18, to page 7, line 17). The feature extraction component, a feature 
combination component, an answer selection component, and an answer presentation 
component, among others, are included (page 7, line 18, to page 9, line 6). The input to 
the system is a set of one or more natural language questions and a collection of textual 

20 documents (page 9, lines 15-22). The output is a (possibly ranked) set of factual answers 
to the questions, these answers being extracted from the document collection (page 9, line 
23, to page 10, line 5). 

In one exemplary embodiment, a method for selecting answers to natural 
language questions from a collection of textual documents is disclosed, comprising the 

25 steps of: extracting scoring features from a candidate list of passages of possible answers 
(page 5, line 24, to page 8, line 22); scoring the possible answers using the extracted 
features and a feature scoring function (page 7, line 18, to page 8, line 22); and presenting 
the best scoring possible answer to the user with context from the passage containing the 
answer (page 8, line 27, to page 10, line 5). 
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In another exemplary embodiment, a computer system that extracts 
answers to natural language questions from a large collection of textual documents is 
disclosed, consisting of one or more of the following modules: a feature extraction 
module; a feature combination module, containing a "feature extraction" and "compute 
5 composite score" components; an answer selection module; and an answer presentation 
module (page 7, line 18, to page 9, line 6). 

In one exemplary embodiment, a computer program product is disclosed 
that performs the steps of: determining a feature scoring function during a training phase 
either manually or via a machine learning algorithm applied to a set of training questions, 
10 corresponding answer passages, and certain extracted features (page 5, line 24, to page 8, 
line 26); and during an execution phase, extracting certain features from questions and 
corresponding possible answer phrases, applying the feature scoring function determined 
during the training phase to score each possible answers phrase (page 5, line 24, to page 
8, line 26), selecting one or more of the best scoring answer phrases (page 8, line 1, to 
15 page 10, line 5), and displaying the answer phrases to the user with optional additional 
context from the answer passages (page 8, line 27, to page 10, line 5). 

In another exemplary embodiment, the parameters of the scoring function 
are learned by a machine learning algorithm (pages 6-10). 

20 STATEMENT OF GROUNDS OF REJECTION TO BE REVIEWED ON APPEAL 

Claims 1-4, 6-8, 10, and 11 are rejected under 35 U.S.C. §102(b) as being 
anticipated by Kupiec and claims 5, 9, and 12 are rejected under 35 U.S.C. § 103(a) as 
being unpatentable over Kupiec in view of Braden-Harder et al. 

25 ARGUMENT 
Independent Claims 1, 7 and 12 

Independent claims 1 and 7 were rejected under 35 U.S.C. § 102(b) as 
being anticipated by Kupiec and claim 12 was rejected under 35 U.S.C. § 103(a) as being 
unpatentable over Kupiec in view of Braden-Harder et al. Regarding claims 1 and 7, the 
30 Examiner asserts that Kupiec discloses scoring the possible answers using the extracted 
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features and a feature scoring function (col 31, lines 60-63). Regarding claim 12, the 
Examiner asserts that Kupiec teaches scoring each possible answer phrase. 

Appellants note that the present invention is directed to selecting answers 
to natural language questions "from a pool of potential answers which are manually or 
5 automatically extracted from a large collection of textual documents." (Page 1, lines 9- 
11.) An "answer" is defined as "a statement (either spoken or written) that is made in 
reply to a question or request or criticism or accusation." (See, dictionary.com) Thus, 
answers are not documents but are short snippets or "snippets of text (e.g., phrases) 
which provide the exact answer to the question." (See, definition on page 1, lines 21-23, 

10 of the present disclosure.) 

Appellants also note that Kupiec teaches that "the relevant articles are 
heuristically scored according to the degree and number of matches with the phrases of 
the input question... After scoring, the relevant articles are ranked according to their 
scores ." (Col. 31, lines 34-41; emphasis added.) After the hypothetical answers are 

15 extracted, Kupiec teaches that "answer hypotheses are scored on a per-article basis 
according to the sum of the scores of the articles in which they occur'' (Col. 31, lines 
52-62; emphasis added.) Kupiec teaches that answer scores are based on the article 
scores; Kupiec does not disclose or suggest that answer scores are based on scoring 
features of possible answers. 

20 In addition, although Braden-Hader utilizes natural language processing to 

identify documents that are relevant to a query, Braden-Harder discloses that the 
processor "presents the retained documents to the user rank-ordered based on their 
score." (Col. 8, lines 2-4; emphasis added.) As with most conventional search engines, 
Braden-Harder simply ranks and returns documents. Braden-Harder does not attempt to 

25 score answers and present answers, as defined in the present invention and as would be 
understood by a person of ordinary skill in the art, and does not attempt to present 
answers with context from a relevant passage. Independent claim 1 requires extracting 
scoring features from a candidate list of passages of possible answers; scoring the 
possible answers using the extracted features and a feature scoring function; and 

30 presenting the best scoring possible answer to the user with context from the passage 
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containing the answer. Independent claim 7 requires a feature combination module, 
containing a "feature extraction' and "compute composite score" components', an 
answer selection module and an answer presentation module, and independent claim 12 
requires scoring "each possible answer phrase, selecting one or more of the best scoring 
5 answer phrases, and displaying the answer phrases to the user." 

Thus, Kupiec and Braden-Harder et al, alone or in combination, do not 
disclose or suggest extracting scoring features from a candidate list of passages of 
possible answers; scoring the possible answers using the extracted features and a feature 
scoring function; and presenting the best scoring possible answer to the user with context 

10 from the passage containing the answer, as required by claim 1, do not disclose or 
suggest a feature combination module, containing a "feature extraction" and "compute 
composite score" components; an answer selection module and an answer presentation 
module, as required by independent claim 7, and do not disclose or suggest scoring each 
possible answer phrase, selecting one or more of the best scoring answer phrases, and 

15 displaying the answer phrases to the user, as required by independent claim 12. 
Claims 5 and 9 

Claims 5 and 9 are rejected under 35 U.S.C. § 103(a) as being unpatentable 
over Kupiec in view of Braden-Harder et al. In particular, the Examiner acknowledges 
that Kupiec lacks teaching that "the parameters of the scoring function are learned by a 
20 machine learning algorithm," but asserts that Braden-Harder discloses this limitation 
(FIGS. 8A and 8B; col. 17, lines 16-67; col. 18, lines 1-24; col. 25, lines 41-48). 

Appellants note that the text cited by the Examiner is directed to "the 
manner through which our invention (Braden-Harder) compares and weights matching 
logical form triples, and ranks corresponding documents." (Col. 17, lines 5-7.) 
25 Appellants could find no disclosure or suggestion by Braden-Harder, however, that "the 
parameters of the scoring function are learned by a machine learning algorithm." 

Thus, Kupiec and Braden-Harder et al., alone or in combination, do not 
disclose or suggest that the parameters of the scoring function are learned by a machine 
learning algorithm, as required by claims 5 and 9. 

30 
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Conclusion 

The rejections of the cited claims under sections 102 and 103 in view of 
Kupiec and Braden-Harder et al., alone or in combination, are therefore believed to be 
improper and should be withdrawn. The remaining rejected dependent claims are 
believed allowable for at least the reasons identified above with respect to the 
independent claims. 

The attention of the Examiner and the Appeal Board to this matter is 

appreciated. 

Respectfully, 

Date: May 18, 2006 Kevin M. Mason 

Attorney for Applicant(s) 
15 Reg. No. 36,597 

Ryan, Mason & Lewis, LLP 
1300 Post Road, Suite 205 
Fairfield, CT 06824 
(203) 255-6560 
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APPENDIX 

1. A method for selecting answers to natural language questions from a 
collection of textual documents comprising the steps of: 

5 extracting scoring features from a candidate list of passages of possible 

answers; 

scoring the possible answers using the extracted features and a feature 
scoring function; and 

presenting the best scoring possible answer to the user with context from 
10 the passage containing the answer. 

2. A method as in claim 1, wherein the features used to score possible 
answers consists of one or more of the following features: a semantic type of a current 
suspected answer, a position of the suspected answer among all suspected answers within 

15 all document passages, a position of the suspected answer among all suspected answers 
within the given passage, a number of suspected answers of a given semantic type 
retrieved within a given passage, a number of words in a suspected answer that do not 
appear in the user question, a position of the semantic type in the list of potential 
semantic types for the question, an average distance in words between the beginning of 

20 the potential answer and the words in the question that also appear in the passage, a 
passage relevance as computed by the information retrieval engine, a frequency of a 
given potential answer on the list, a semantic relation between words from the question 
and words from the potential answer, and a strength score that is a function of the 
relevance score. 

25 

3. A method as in claim 2, wherein the feature scoring function is a linear 
combination of weighted features. 

4. A method as in claim 3, wherein the parameters of the scoring function 
30 are manually determined. 
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5. A method as in claim 3, wherein the parameters of the scoring function 
are learned by a machine learning algorithm. 

6. A method as in claim 1 where the candidate list of passages of possible 
5 answers is obtained from the collection of documents using an information retrieval 

engine 

7. A computer system that extracts answers to natural language questions 
from a large collection of textual documents consisting of one or more of the following 

10 modules: 

a feature extraction module; 

a feature combination module, containing a "feature extraction" and 
"compute composite score" components; 

an answer selection module; and 
1 5 an answer presentation module. 

8. A computer system, as in claim 7, wherein the feature extraction 
module extracts one or more of the following features: a semantic type of the current 
suspected answer, a position of the suspected answer among all suspected answers within 

20 all document passages, a position of the suspected answer among all suspected answers 
within the given passage, a number of suspected answers of a given semantic type 
retrieved within a given passage, a number of words in an suspected answer that do not 
appear in the user question, a position of the semantic type in the list of potential 
semantic types for the question, an average distance in words between the beginning of 

25 the potential answer and the words in the question that also appear in the passage, a 
passage relevance as computed the retrieval engine, a frequency of a given potential 
answer on the list, a semantic relation between words from the question and words from 
the potential answer, and a strength score that is a function of the relevance score. 
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9. A computer system as in claim 7, wherein the feature combination 
module employs a feature scoring function with parameters learned by a machine 
learning method. 

5 10. A computer system as in claim 7, wherein the answer selection module 

selects the answer with the best score obtained from the feature combination module. 

11. A computer system as in claim 7, wherein the answer presentation 
module shows the top scored answer within the context as specified by a user or a 

10 system. 

12. A computer program product that performs the steps of: 
determining a feature scoring function during a training phase either 

manually or via a machine learning algorithm applied to a set of training questions, 
15 corresponding answer passages, and certain extracted features; and 

during an execution phase, extracting certain features from questions and 
corresponding possible answer phrases, applying the feature scoring function determined 
during the training phase to score each possible answers phrase, selecting one or more of 
the best scoring answer phrases, and displaying the answer phrases to the user with 
20 optional additional context from the answer passages. 
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EVIDENCE APPENDIX 
There is no evidence submitted pursuant to § 1.130, 1.131, or 1.132 or 
entered by the Examiner and relied upon by appellant. 
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RELATED PROCEEDINGS APPENDIX 
There are no known decisions rendered by a court or the Board in any 
proceeding identified pursuant to paragraph (c)(l)(ii) of 37 CFR 41 .37. 
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